[Pilot study of domain-specific terminology adaptation for morphological analysis: research on unknown terms in national examination documents of radiological technologists].

نویسندگان

  • Shintarou Tsuji
  • Naoki Nishimoto
  • Katsuhiko Ogasawara
چکیده

Although large medical texts are stored in electronic format, they are seldom reused because of the difficulty of processing narrative texts by computer. Morphological analysis is a key technology for extracting medical terms correctly and automatically. This process parses a sentence into its smallest unit, the morpheme. Phrases consisting of two or more technical terms, however, cause morphological analysis software to fail in parsing the sentence and output unprocessed terms as "unknown words." The purpose of this study was to reduce the number of unknown words in medical narrative text processing. The results of parsing the text with additional dictionaries were compared with the analysis of the number of unknown words in the national examination for radiologists. The ratio of unknown words was reduced 1.0% to 0.36% by adding terminologies of radiological technology, MeSH, and ICD-10 labels. The terminology of radiological technology was the most effective resource, being reduced by 0.62%. This result clearly showed the necessity of additional dictionary selection and trends in unknown words. The potential for this investigation is to make available a large body of clinical information that would otherwise be inaccessible for applications other than manual health care review by personnel.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی تطبیقی اصطلاح‌نامه معارف اسلامی و علوم قرآنی

This study examines the comparative strengths and weaknesses of the thesaurus and thesaurus Quranic teachings of the Koran. In today's society where the documents are kept electronically, retrieval and dissemination of information for the development of research, much greater importance of saving documents and thesaurus that is the basis for indexing in various sciences, One of the solutions fo...

متن کامل

Political Terms by APLL: Issues of Terminology Implantation and ‎Acceptability

The present study investigates the implantation of political science terminology approved by the Academy of Persian Language and Literature (APLL) in the Hamshahri corpus made up of news text from Hamshahri newspaper and their acceptability among MA students of English translation studies (ETS), English literature (EL), and Political science (PS). To conduct this research the frequencies of the...

متن کامل

A Theoretical Review of Disaster’s Social Terminology

Background and Aim: Using common terms and synonyms, such as concepts related to natural crisis managements, is one of the major challenges in interdisciplinary topics; which are apparently similar but have different meanings and practical implications. “Natural disaster” is one of these terms. The main purpose of this study is to describe the common and professional term in science and creatin...

متن کامل

Radiation-related neoplasms, circulatory diseases, and cataracts among radiological technologists

Background: In response to the need for diagnosis and treatment, medical radiation has been increasingly used worldwide. This study investigated the medical utilization of radiation-related diseases among radiological technologists (RTs) and factors that influence such diseases. Materials and Methods: Data were collected from the Taiwan National Health Insurance Research Database. A panel study...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nihon Hoshasen Gijutsu Gakkai zasshi

دوره 64 7  شماره 

صفحات  -

تاریخ انتشار 2008